Towards Deep Modeling of Music Semantics using EEG Regularizers

نویسندگان

  • Francisco Raposo
  • David Martins de Matos
  • Ricardo Ribeiro
  • Suhua Tang
  • Yi Yu
چکیده

Modeling of music audio semantics has been previously tackled through learning of mappings from audio data to high-level tags or latent unsupervised spaces. The resulting semantic spaces are theoretically limited, either because the chosen high-level tags do not cover all of music semantics or because audio data itself is not enough to determine music semantics. In this paper, we propose a generic framework for semantics modeling that focuses on the perception of the listener, through EEG data, in addition to audio data. We implement this framework using a novel end-to-end 2-view Neural Network (NN) architecture and a Deep Canonical Correlation Analysis (DCCA) loss function that forces the semantic embedding spaces of both views to be maximally correlated. We also detail how the EEG dataset was collected and use it to train our proposed model. We evaluate the learned semantic space in a transfer learning context, by using it as an audio feature extractor in an independent dataset and proxy task: music audio-lyrics crossmodal retrieval. We show that our embedding model outperforms Spotify features and performs comparably to a state-of-the-art embedding model that was trained on 700 times more data. We further discuss improvements to the model that are likely to improve its performance.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Neural Correlates of Boredom in Music Perception

Introduction: Music can elicit powerful emotional responses, the neural correlates of which have not been properly understood. An important aspect about the quality of any musical piece is its ability to elicit a sense of excitement in the listeners. In this study, we investigated the neural correlates of boredom evoked by music in human subjects. Methods: We used EEG recording in nine sub...

متن کامل

Fusion of electroencephalographic dynamics and musical contents for estimating emotional responses in music listening

Electroencephalography (EEG)-based emotion classification during music listening has gained increasing attention nowadays due to its promise of potential applications such as musical affective brain-computer interface (ABCI), neuromarketing, music therapy, and implicit multimedia tagging and triggering. However, music is an ecologically valid and complex stimulus that conveys certain emotions t...

متن کامل

Classifying music perception and imagination using EEG

This study explored whether we could accurately classify perceived and imagined musical stimuli from EEG data. Successful EEG-based classification of what an individual is imagining could pave the way for novel communication techniques, such as brain-computer interfaces. We recorded EEG with a 64-channel BioSemi system while participants heard or imagined different musical stimuli. Using princi...

متن کامل

Combination of Beamforming and Synchronization Methods for Epileptic Source Localization, using Simulated EEG Signals

Localization of sources in patients with focal seizure has recently attracted many attentions. In the severe cases of focal seizure, there is a possibility of doing neurosurgery operation to remove the defected tissue. The prosperity of this heavy operation completely depends on the accuracy of source localization. To increase this accuracy, this paper presents a new weighted beamforming method...

متن کامل

A hybrid EEG-based emotion recognition approach using Wavelet Convolutional Neural Networks (WCNN) and support vector machine

Nowadays, deep learning and convolutional neural networks (CNNs) have become widespread tools in many biomedical engineering studies. CNN is an end-to-end tool which makes processing procedure integrated, but in some situations, this processing tool requires to be fused with machine learning methods to be more accurate. In this paper, a hybrid approach based on deep features extracted from Wave...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1712.05197  شماره 

صفحات  -

تاریخ انتشار 2017